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ABSTRACT 

Space missions such as Kepler and CoRoT have led to large numbers of eclipse or transit measurements in nearly continuous time 
series. This paper shows how to obtain the period error in such measurements from a basic linear least-squares fit, and how to correctly 
derive the timing error in the prediction of future transit or eclipse events. Assuming strict periodicity, a formula for the period error 
of such time series is derived: CTp = ctj- (12/(V^ - where <Tp is the period error; the timing error of a single measurement 

and N the number of measurements. Relative to the iterative method for period error estimation by Mighell & Plavchan (2013), this 
much simpler formula leads to smaller period errors, whose correctness has been verified through simulations. For the prediction of 
times of future periodic events, the usual linear ephemeris where epoch errors are quoted for the first time measurement, are prone to 
overestimation of the error of that prediction. This may be avoided by a correction for the duration of the time series. An alternative is 
the derivation of ephemerides whose reference epoch and epoch error are given for the centre of the time series. For long continuous 
or near-continuous time series whose acquisition is completed, such central epochs should be the preferred way for the quotation of 
linear ephemerides. While this work was motivated from the analysis of eclipse timing measures in space-based light curves, it should 
be applicable to any other problem with an uninterrupted sequence of discrete timings for which the determination of a zero point, of 
a constant period and of the associated errors is needed. 
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1. Motivation and objectives 

The space missions MOST, Kepler and CoRoT have been ded¬ 
icated to the acquisition of near-continuous photometry over 
longer time scales. From them, timing measurements of eclipse 
or transit events have become available of a different nature from 
those from ground-based campaigns. Their main difference is the 
completeness of coverage between the first and the last measure¬ 
ment, with duty cycles of about 90% (Michel 2013, for Kepler 
and CoRoT) over time scales ranging from weeks to years. The 
derivation of precise ephemerides for timing measurements of 
periodic events (typically eclipses or transits) in such data may 
therefore also require revised methods. In this paper, we ad¬ 
dress two points: the estimation of the period error on the one 
hand and the error in the prediction of the time of future eclipse 
events - also denominated the ‘prediction error’ - on the other. 
The derivation of the prediction error is related to the correct us¬ 
age of the epoch error of an ephemeris. This will lead also to 
a recommendation for an improved quoting of ephemeris from 
such long-cadence time series. In all the work presented here, an 
intrinsically constant period of the observed target is assumed. 
An algorithmic method to derive period errors from continuous 
series of timing measurements has been published by Mighell 
& Plavchan (2013, furthermore M&P) In the first part (Sect. 2) 
of this communication, it is shown that a linear fit of the mea¬ 
sured individual transit or eclipse timings (or of ’O-C’ residuals 
derived against a preliminary ephemeris) is the correct way for a 
determination of a period and its error, leading to a very simple 
equation to estimate the period error. The epoch error in a linear 


ephemeris is elaborated in the second part (Sect. 3). This error, if 
quoted as usual for the first timing measurement in a dataset, is 
shown to be a non-optimum description of the zero-point error in 
a linear ephemeris, which may lead to overestimated prediction 
errors. Correct ways to estimate the prediction error of future 
events are then given. 


2. Derivation of the period error 

Our objective is the estimation of the error of the period P, given 
a continuous sequence of N timing measurements Tg at integer 
Epochs E, with E - Q,...,N- 1, and assuming that the period 
to be measured is intrinsically constant (e.g. P does not vary 
with E). It is also assumed that all timing measurements have an 
identical time error ctt- A linear ephemeris given by 

T,,e^P-E + T,p, ( 1 ) 

Tcfi being the time of zero-epoch, can then be derived from mini¬ 
mizing the residuals Te-Tc,e- We note that Te-Tc,e corresponds 
to the commonly used (O-C) or ‘observed - calculated’ residuals. 

The minimization to determine best-fit parameters for P 
and Tcfi is then given by the linear regression: 


A 


Z 

£=0 


(Te -{PE + Tcfi)f 


ert 


( 2 ) 


The least squares estimate of the slope b of a linear fit y = a H- 
bx to data-tuples {xi,yi) can be found in many basic works on 
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statistics (e.g. Kenney & Keeping 1962; Press et al. 1992) and is 
given by: 

where we use summation^ over i - 0,... ,N - 1. Recognizing 
that the values jc,- are given by the Epoch number E, and changing 
to the nomenclature of Eq.|^(a ^ Tcfi, b ^ P, y, ^ Te) and 
for the convenience of writing, replacing E by i, we obtain: 

(4) 

It is of note that the Tj in Eq. |^may be either the measured 
times themselves (quoted for example in BID) or O-C’ residuals 
against some other (preliminary) ephemeris. In that case, Eq. 
delivers the difference to the period of that ephemeris. 

Making use of the identities i - N{N - l)/2 and 
- N{N - l)i2N - l)/6 , we find, after some basic al¬ 
gebra: 


P = 


12 2 Ti (/ - 

N^-N 


cnJ] TiWi ; CN = 12/(iV3 _ ( 5 ^ 


The terms w, = i - acts as weighting coefficients for the 
timings T,, the highest weight being given for the timings at the 
beginning and end of a dataset, and with little or no weight for 
those near the centre. 

Erom above equation for P - P(7’o,..., r^_i) we can then 
derive the period error crp using error propagation; e.g. cr^ = 
Zi (‘Xr§f:)^, which leads immediately to: 


■Z' 


(6) 


Using again the identities for Zjlo* ‘ 2/Io* 


2 W; = 1/cjv and arrive at the final result: 


2 2 

(J~ p — O' Y — 


120-2 

N^-N 


\N>2. 


(7) 


2.1. Comparison with the period errors of Migheii & Piavchan 

In the following, period error estimates from Eq.[7]are compared 
to similar ones given by M&P. They use an iterative algorithm, 
denominated ‘Period Error Calculator (PEC)’ that obtains the 
period error through multiple combinations of the errors of the 
manifold 2-point measurements that are present within a series 
of timing measures. Eor = 2 to 8 timing measurements, M&P 
quote explicit value^that correspond to the coefficients of 
Eqs0 or|^given above. A comparison of these values is shown 
in Table 1. Their results agree with Eq. only for the case of 
N = 2 or N - 3. Up to A = 8, differences remain small within 
=s30%. Without implementing their PEC algorithm, we can also 
compare with their example of a strictly periodic variable with 
A = 171 timing measurements, each with an uncertainty of 
<tt = 0.0104 days, for which they derive a period error of 23 
microdaysErom our Eq.|^ with cn\ - 2.40 ■ 10“®, we derive, 

* Usually, summations over indices going from 1 to A are assumed in 
Eq.|^and also in Eq.[^ The change to indices going from 0 to A - 1 has 
no consequences as long as the summations go over a total of A terms. 
We prefer here indices starting with 0 in order to start with E=0 in the 
linear ephemeris, as in EquationfT] 

2 See the ‘Reduced’ values in me table 1 of M&P. They give them 
for M = 1,..,7 period cycles, which correspond to A = 2,..,8 timing 
measurements 

^ See M&P’s Figure 1 and accompanying text, which is for 170 cycles, 
corresponding to 171 measurements. 
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Table 1. Comparison between values of this work and of Mighell & 
Piavchan 


A« 

This work 

Cn 

M&P 

Cn'’ 

2 

2 

2 

3 

0.5 

0.5 

4 

0.2 

0.22222 

5 

0.1 

0.11806 

6 

0.05714 

0.07125 

7 

0.03571 

0.04574 

8 

0.02381 

0.03163 

171 

2.40x10'® 

4.89x10'® 


Notes. A corresponds to M + 1 in M&P. Corresponding to the 
‘Reduced’ square of the period error estimates from M&P’s table 1. 


however, a period error of 16.1 microdays, which implies that 
their ‘reduced value’ for N - 171 is (23/16.1)2 = 2.04 times 
larger than cm. 


2.2. Simuiations of the period error and appiication notes 

Given that period error estimates from Eq.|7]deviate significantly 
from those of the algorithm by M&P, the results of Eq. [7] were 
verified by a set of simulations as follows. Assuming that an in¬ 
trinsic (correct) ephemeris is given by TdE) - Tcp + E Pc, a set 
of A timing measurements for E = 0,.., A - 1 is generated, with 
values Te that are randomly drawn from a normal distribution 
with a standard-deviation of ctj- and centred on the calculated 
value TciE). The error of each individual timing measurement is 
also set to be (Tt- An example of such a simulated set of mea¬ 
surements in the form of an O-C diagram against the intrinsic 
ephemeris is shown in Eigure [1] A linear ephemeris is then fit¬ 
ted by minimizing4f2 (e.g. as given by Eq. ^or Eq. [^, which 
gives us the ‘observed period’ Pfit, and the deviation against the 
intrinsic period, AP - Pfjt - Pc. This procedure can easily be re¬ 
peated many times using the same intrinsic ephemeris and a his¬ 
togram of the deviations AP be produced, as shown in Eigure ^ 
The simulations show that, for large numbers of repetitions, the 
mean of AP becomes close to zero, and the standard deviation 
of AP very closely approaches the value predicted by Eq[^ Such 
simulations with 100 000 repetitions were performed for values 
of A = 3, 10, 100, 1000, all of them showing the validity of the 
period error given by Equation |7] 

Equation [7] assumes that the measured times are normally 
distributed around the intrinsic (and unknown) linear ephemeris, 
and that the errors of the individual timing measurements (which 
are usually assigned by the observer) are of a size similar to the 
(O-C) residuals of the timings against this ephemeris. When this 
condition is not given, the reduced chi-square (using Eq. and 
Tred “ IN) against the intrinsic ephemeris deviates signifi¬ 
cantly from 1. In practice, we know only the fitted ephemeris. 
The reduced chi-square against it should be calculated as ~ 
UN - 2), accounting for the two unknown fit parameters. If 
Xj^^ deviates substantially from 1, there is in principle no way 
to know if this is due to unusually large or small random er¬ 
rors in the measurements or if it has other origins. Eor the case 
°^Tred ^ E we don’t know if measurements have been fortu¬ 
itously well aligned, or if measurement errors have been over¬ 
stated. Eor;^^2^j » 1, measurement errors might have been un- 
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Fig. 1. An ‘O-C’-style diagram of a simulation of N = 10 timing mea¬ 
surements drawn from a normal distribution with a standard deviation 
of CTj- = 1 against the intrinsic ephemeris, with individual errors also of 
±1. The time unit is irrelevant in these simulations and any unit can be 
assumed. The intrinsic (correct) ephemeris would be a horizontal line at 
O - C = 0 (not shown). The solid line shows the best fit to these mea¬ 
surements. The period error, given by Eq.|^ which is ±0.1101 time units 
per epoch, is indicated by the slopes of the dashed lines. In this partic¬ 
ular simulation, the reduced chi-square against the intrinsic ephemeris 
was 0.64, whereas the reduced chi-square against the fit was 0.75. 



A P 


Fig. 2. Histogram of differences AP between fitted and intrinsic periods 
generated by repeating 100 000 times a simulation for N = 10, as shown 
in Figure^ The standard deviation of AP is 0.1099, very close to the 
value from Eq. 1^(0. 1101 for A = 10). 


derstated. However, an intrinsic period-variation could also be 
the origin for the poor fit - a revision of the physical condition 
of the observed system should then be performed, to evaluate if 
that might be a valid hypothesis. Even if the fitted ephemeris in¬ 
dicates a good fit With 1, we have to be aware that a large 

against the unknown true ephemeris may still be present, and 
that the fitted parameters may have errors larger than expected. 
This can happen due to measurements being fortuitously aligned 
to represent a rather deviant ephemeris. A simulation with such 
an outcome in shown in Fig. 



Fig. 3. Like Fig. In this particular simulation, a fortuitously good 
fit is obtained from data with a rather large scatter; The reduced chi- 
square against the intrinsic ephemeris was 1.43, whereas the reduced 
chi-square against the ephemeris fit (solid line) is 0.97. However, the 
fit’s period has a rather large error, being 2.5 times as large as the period- 
error given by Eq.|^ (dashed lines) 

3. Epoch and prediction errors 

In a similar way to that discussed for the period and period error 
in the previous section, we can derive the intercept of the linear 
fit, which gives the ephemeris zero epoch Tcfi and its error. 


3.1. Epoch errors at the beginning of a measurement 
sequence 

First, we continue to use the usual epoch indices ranging from 0 
to A - 1 and start from a common equation' for the intercept: 


a - 


ZyiZxf-Z Xi Z xiyi 
NZxj -(Z Xi)^ 


( 8 ) 


Changing, as before, to the nomenclature of Eq.j^and with very 
similar algebra, we obtain: 


5 y 7’. ( zw-i _ 

^-.0 = -= 6/(A2 + N). (9) 


Again, terms v, = - i act as weighting coefficients for the 

timings T,, with a weight that goes from about |A through zero 
to -^N. For the error of we obtain similarly to Eq.j^ 

= 4o-r y (10) 

Evaluating the sum of the weighting coefficients as 
Z’^j — N(N + 1)(2A - 1)/18, we then obtain: 


(4A - 2) 0-2 
N^+N 


( 11 ) 


3.2. Ephemeris with zero epoch at the centre of the 
measurement sequence 

In the following, a zero epoch at the centre of the measurement 
sequence is considered. For simplicity, only the case with an odd 
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number of measurements is elaborated. The indices (epochs) of 
the measurements are now labelled j and will be j - -k,... ,k 
with k = ^{N - 1). For the period, we start again from the basic 
Eqs. I^or rather]^ For the summations, going now from -k to 
+k, we employ the identities Ti)=-k J — ^ ^^d T!j=-k + 

1)(A^ - 1)/12 . The value for the period then turns out as: 


^ 121 : Tj j 

N^-N ' 


( 12 ) 


Given that j - i - this is identical to Equation|^ The equa¬ 
tion for the period etTor is then of course also identical to Equa¬ 
tion [7] Eor the intercept, or zero-epoch, and starting from Eq. 
however, we obtain a different and much simpler expression: 


1 

N 



(13) 


This ‘centre’ or ‘middle epoch’ has been labelled Tc^m in order 
to distinguish it from the usual zero epoch, Tcp, at the first mea¬ 
surement. The corresponding error is given by: 



(14) 


The different outcome for the epoch eiTor, depending on its loca¬ 
tion at the beginning or in the centre of a measurement sequence, 
can be explained in the following way. Erom the epoch error at 
the centre of a measurement sequence (Tt.^ and from the pe¬ 
riod error crp, we can can calculate the expected timing error at 
the beginning (or at the end) of the sequence, using the 

square-sum of eiTors: 

(15) 


where is the number of periods between the sequence’s be¬ 
ginning and its centre. Inserting the expressions for ctj-^ and crp 
into Eq. we find that with being given 

by Equation This means that the error of the zero epoch of 
an ephemeris quoted in the conventional way, with E - 0 cor¬ 
responding to the first timed event, is really the error-sum of the 
‘true’ epoch error ctj-^ in the middle of the sequence plus a con¬ 
tribution from the period error. 


3.3. Consequences for the prediction of events beyond the 
measurement sequence 


A fundamental function of an ephemeris is the prediction of tran¬ 
sit or eclipse events beyond the end of a measurement sequence, 
giving both the time and the time uncertainty of such future 
events. The usually quoted crj^g and crp may, however, easily 
lead to an overestimation of this timing error if a naive error- 
sum given by cr^ £ = crj^ + (E crp)^ is used, as is illustrated 
by the dashed slopes in Eigure|^ There are two solutions to cir¬ 
cumvent such overestimation of the ‘prediction error’. The first 
solution, can be retrievecpifrom the conventionally quoted 
epoch errors by reversing Eq. [T5| 




(16) 


As a side-note to Eq. the calculation of cr-p^^^ for existing 
ephemerides may also serve as a diagnostics on the correct sizing of 
these ephemeris errors since crp^,,^ can easily be related to the size of 
the errors of individual timing measurements through Eq. [T^ and, of 
course, cr^ needs to come out as a positive number - or else crp^g is 
underestimated and/or crp is overestimated. 
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Fig. 4. Expected timing uncertainties during and after a sequence of 
timing measurements based on an ephemeris with given errors in epoch 
and period. The axes are similar to those in Figure [T] The dashed line 
gives the development of the 1-sigma uncertainty from an error sum 
using the period error and the epoch error at the beginning of the se¬ 
quence. The solid lines outline the correct timing uncertainty, with the 
epoch error being derived for the centre of the sequence. For epochs 
beyond the end of the sequence, this uncertainty is also identical to the 
‘prediction uncertainty’ given by Equation|18[ 


The time-uncertainty at any epoch before, during, or after the 
measurement sequence can then be obtained from the error-sum: 

o-r,. = 0-2^,,, + (; o-p)^ (17) 

where j is the number of epochs relative to the centre of the se¬ 
quence. A second, simpler, way to obtain a correct prediction 
error comes from the observation that the epoch error at the be¬ 
ginning or at the end of a measurement sequence should be iden¬ 
tical. We can then use the conventional epoch error crj^^ for the 
prediction of future events by counting from the end of the se¬ 
quence, using: 

+{E- Nfaj, ;E>N, (18) 

where E is the epoch number since the begin of the sequence 
and cTp^^ is the time uncertainty at E. This equation correctly 
takes into account the duration of the measurement sequence 
(see also Fig.|^. 


4. Conclusions 

In the first part of this communication, a simple formula for the 
derivation of period errors in continuous sequences of timing 
measurements with identical timing errors has been derived and 
verified through a set of simulations. For the extraction of a lin¬ 
ear ephemeris from a set of timing measurements, which implies 
that an intrinsic constant period is assumed, there is no apparent 
reason to use other methods than a linear fit based on an error 
minimization. There is no reason to use another method for the 
estimation of the fit-parameters errors beyond an error propa¬ 
gation from the equations that determine the fit parameters (in 
this case, from Eqs[^and[^. In the case of identical measure¬ 
ment errors in a continuous series of data, equations for these 
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errors simplify to those given in Sect. 2 for the period (Eq. 0 
and Sect. 3 for the epoch (Eqs[^and 14 1 . The only point open 
to variation is the type of error minimization used in the linear 
fit, where other methods beyond chi-square minimization might 
be considered, such as minimizing absolute errors; and/or ro¬ 
bust fits that reject outliers. These may lead to slightly different 
error-estimates, all of them, however, are based on the residuals 
against the best linear best fit, independently on how that fit was 
obtained. 

It is not the aim of this communication to revise or anal¬ 
yse the ‘PEC’ algorithm of M&P, which derives the period error 
from a combination of timing errors between any pairing of two 
time measurements within a sequence. Certainly, the algorithm 
by M&P is a vastly more complicated way to derive the period 
error. Differences between the error estimates from Eq. |7]of this 
work and M&P’s ‘PEC’ - which increase with the number of 
timing measures - are probably caused by an incorrect weight¬ 
ing of the individual 2-point timing measurements from which 
PEC constructs its final result. Verifying this would, however, 
need a detailed analysis of PEC, which is beyoned of the scope 
of this study. 

Both this work and the one by M&P assume identical errors 
for all individual timing measurements. In practice, even in space 
missions such as CoRoT and Kepler, imperfect duty cycles cause 
occasional misses or incomplete transits. Eurthermore, cosmic- 
ray hits may degrade light curves of individual transits, leading 
to larger timing errors. Eor practical applications of the equa¬ 
tions presented here, occasional measurements with strongly de¬ 
viating timing errors (or missed measurements) can be ignored 
as long as a predominant timing error can be identified. If such a 
predominant error cannot be identified (e.g. owing to a change of 
integration time in Kepler light curves), or if a significant frac¬ 
tion of timing measurements is missing, the simplified equations 
presented here will not be reliable. The ephemeris and its er¬ 
rors should then be derived from a numerical least-squares mini¬ 
mization of Eq.[^ using individual timing errors (e.g. Press et al. 
1992 , sect. 14.2). 

In the second part of this article, two equations for the epoch 
error of continuous timing sequences are derived. In the first of 
these equations,the error is given for a ‘zero’ epoch that corre¬ 
sponds to the first timing measurement (Eq. [TT). This is the con¬ 
ventional way in which ephemerides are indicated. In the second 
equation, a much simpler expression is obtained for the epoch 
error at the centre of a timing sequence (Eq. [I4j l. It is shown 
that these errors are equivalent, the epoch error at the beginning 
(or end) of a timing sequence being the error sum of the cen¬ 
tral epoch error plus the period error. Ephemerides of long se¬ 
quences of timing measurements would therefore be more logi¬ 
cally expressed with epochs and epoch errors for a timing mea¬ 
surement at or near the sequence’s centre. With such a ‘central 
ephemeris’, the estimation of timing errors beyond the end of the 
original measurement sequence will then be correctly performed 
by a simple error sum between epoch error and the period er¬ 
ror. The ‘central ephemeris’ serves also to correctly estimate the 
uncertainties of the true (intrinsic) event times during the mea¬ 
surement sequence. With conventional ephemerides, on the other 
hand, a correction for the duration of the measurement sequence 
is needed in order to derive correct timing errors for predictions 
past the measurement sequence. Eor the photometric follow-up 
of CoRoT planets and planet candidates, Eq. has been im¬ 
plemented for several years in an online calculator]^ There, the 


^ http://www.iac.es/proyecto/corot/followup/ (Access will be provided 
upon request to the author) 


numbers N of observed transit events during the CoRoT point¬ 
ings are estimated from the target periods and the pointing dura¬ 
tions, ranging from 28d to 159d. Its predictions of future transit 
events have been shown to be reliable for CoRoT’s planet can¬ 
didate verification programme (described initially in Deeg et al. 
2009), as well as for an ongoing re-observation of CoRoT planet 
transits (Klagyivik et al., in prep.). The use of Eq. [TSlover naive 
error-sums of epoch and period errors (counting the epochs since 
the beginning of the measurements) is still more important when 
ephemerides of targets from the Kepler mission are considered, 
since in that case the difference between ‘naive’ and correct pre¬ 
diction errors is much larger, due to the 3.9 yr coverage of its 
light curves. 

A successful re-observation of transits or eclipses of objects 
discovered by space missions such as CoRoT or Kepler depends 
critically on the correct prediction of their transit time and tim¬ 
ing errors. Such predictions are essential when only a few hours 
are available to observe a given transit or eclipse. A best pos¬ 
sible derivation of the ephemeris errors and the prediction er¬ 
rors is therefore very important for the legacies of the space 
missions that have brought us these wonderful datasets of long, 
continuous and highly precise time series. Precise ephemeris 
measurements may also be expected to have a similar impact 
on follow-up observations of future planet detection missions, 
namely TESS (Ricker et al. 2014) and PLATO (Rauer et al. 
2014) 
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